new user
Breaking the Cold-Start Barrier: Reinforcement Learning with Double and Dueling DQNs
Recommender systems struggle to provide accurate suggestions to new users with limited interaction history, a challenge known as the cold-user problem. This paper proposes a reinforcement learning approach using Double and Dueling Deep Q-Networks (DQN) to dynamically learn user preferences from sparse feedback, enhancing recommendation accuracy without relying on sensitive demographic data. By integrating these advanced DQN variants with a matrix factorization model, we achieve superior performance on a large e-commerce dataset compared to traditional methods like popularity-based and active learning strategies. Experimental results show that our method, particularly Dueling DQN, reduces Root Mean Square Error (RMSE) for cold users, offering an effective solution for privacy-constrained environments.
- Research Report > Experimental Study (0.69)
- Research Report > New Finding (0.67)
Get 60% off a lifetime of flights and hotels for life
If you've ever booked a hotel or flight only to see the price drop the very next day, or have been putting your travel dreams on hold altogether because of the prices, your worries are over. OneAir is a members-only, all-in-one AI-powered travel platform built for modern travelers. Their intelligent search engine scans and tracks millions of flight and hotel deals from your home airport to leading global destinations. Members receive instant mobile and email alerts as soon as travel prices drop. A lifetime subscription is now available to new users for just 59.99 when you use code FLY30 at checkout.
ChatGPT gained one million new users in an hour today
OpenAI has been doubling its audience for ChatGPT at a rapid rate, and the addition of its latest image generation feature has increased the AI assistant's popularity. Today, CEO Sam Altman posted to X that the service "added one million users in the last hour," calling it a "biblical demand" for the image generation. When the company announced the rollout of image generation in ChatGPT last week, the tool was meant to be available to all user tiers. However, the high degree of interest meant that access for free users was walked back. Now, the company is reporting "issues with new signups," which has been its status for more than a day.
When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning
Dong, Yijiang River, Hu, Tiancheng, Liu, Yinhong, Üstün, Ahmet, Collier, Nigel
While Reinforcement Learning from Human Feedback (RLHF) is widely used to align Large Language Models (LLMs) with human preferences, it typically assumes homogeneous preferences across users, overlooking diverse human values and minority viewpoints. Although personalized preference learning addresses this by tailoring separate preferences for individual users, the field lacks standardized methods to assess its effectiveness. We present a multi-faceted evaluation framework that measures not only performance but also fairness, unintended effects, and adaptability across varying levels of preference divergence. Through extensive experiments comparing eight personalization methods across three preference datasets, we demonstrate that performance differences between methods could reach 36% when users strongly disagree, and personalization can introduce up to 20% safety misalignment. These findings highlight the critical need for holistic evaluation approaches to advance the development of more effective and inclusive preference learning systems.
- Europe > Austria > Vienna (0.14)
- Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
- North America > United States > Florida > Miami-Dade County > Miami (0.04)
- (13 more...)
DGSense: A Domain Generalization Framework for Wireless Sensing
Zhou, Rui, Cheng, Yu, Li, Songlin, Zhang, Hongwang, Liu, Chenxu
Wireless sensing is of great benefits to our daily lives. However, wireless signals are sensitive to the surroundings. Various factors, e.g. environments, locations, and individuals, may induce extra impact on wireless propagation. Such a change can be regarded as a domain, in which the data distribution shifts. A vast majority of the sensing schemes are learning-based. They are dependent on the training domains, resulting in performance degradation in unseen domains. Researchers have proposed various solutions to address this issue. But these solutions leverage either semi-supervised or unsupervised domain adaptation techniques. They still require some data in the target domains and do not perform well in unseen domains. In this paper, we propose a domain generalization framework DGSense, to eliminate the domain dependence problem in wireless sensing. The framework is a general solution working across diverse sensing tasks and wireless technologies. Once the sensing model is built, it can generalize to unseen domains without any data from the target domain. To achieve the goal, we first increase the diversity of the training set by a virtual data generator, and then extract the domain independent features via episodic training between the main feature extractor and the domain feature extractors. The feature extractors employ a pre-trained Residual Network (ResNet) with an attention mechanism for spatial features, and a 1D Convolutional Neural Network (1DCNN) for temporal features. To demonstrate the effectiveness and generality of DGSense, we evaluated on WiFi gesture recognition, Millimeter Wave (mmWave) activity recognition, and acoustic fall detection. All the systems exhibited high generalization capability to unseen domains, including new users, locations, and environments, free of new data and retraining.
- Health & Medicine (0.49)
- Information Technology (0.46)
The Download: Bluesky's impersonators, and shaking up the economy with ChatGPT
Like many others, I recently joined Bluesky. On Thanksgiving, I was delighted to see a private message from a fellow AI reporter, Will Knight from Wired. Or at least that's who I thought I was talking to. I became suspicious when the person claiming to be Knight said they were from Miami, when Knight is, in fact, from the UK. The account handle was almost identical to the real Will Knight's handle, and used his profile photo. Then more messages started to appear.
How to Get Started on Bluesky
The social media app Bluesky just reached the top of the free download charts for Apple's app store in the United States, making it--for the moment, at least--more popular than Meta's Threads and OpenAI's ChatGPT. The decentralized social media platform has received a fresh influx of users over the last week as more X users sour on Elon Musk's political ambitions and abandon his social media platform for an alternative. Over a million new users have joined Bluesky since the 2024 US presidential election on November 5, which was shaped by Musk's influence. First launched in 2019 as a project within Twitter, Bluesky gained independence from the company before Musk's acquisition and its subsequent name change. Bluesky also captured the attention of some ex-tweeters back in 2023, when new users were only able to sign up through an invite system.
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.56)
- Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.56)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)
Minimizing Live Experiments in Recommender Systems: User Simulation to Evaluate Preference Elicitation Policies
Hsu, Chih-Wei, Mladenov, Martin, Meshi, Ofer, Pine, James, Pham, Hubert, Li, Shane, Liang, Xujian, Polishko, Anton, Yang, Li, Scheetz, Ben, Boutilier, Craig
Evaluation of policies in recommender systems typically involves A/B testing using live experiments on real users to assess a new policy's impact on relevant metrics. This ``gold standard'' comes at a high cost, however, in terms of cycle time, user cost, and potential user retention. In developing policies for ``onboarding'' new users, these costs can be especially problematic, since on-boarding occurs only once. In this work, we describe a simulation methodology used to augment (and reduce) the use of live experiments. We illustrate its deployment for the evaluation of ``preference elicitation'' algorithms used to onboard new users of the YouTube Music platform. By developing counterfactually robust user behavior models, and a simulation service that couples such models with production infrastructure, we are able to test new algorithms in a way that reliably predicts their performance on key metrics when deployed live. We describe our domain, our simulation models and platform, results of experiments and deployment, and suggest future steps needed to further realistic simulation as a powerful complement to live experiments.
- Oceania > Australia > Victoria > Melbourne (0.14)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Asia > Taiwan > Taiwan Province > Taipei (0.04)
- (18 more...)
- Media (0.68)
- Information Technology > Services (0.67)
- Information Technology > Modeling & Simulation (1.00)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Improving Emotion Recognition Accuracy with Personalized Clustering
Gutierrez-Martin, Laura, Ongil, Celia Lopez, Lanza-Gutierrez, Jose M., Calero, Jose A. Miranda
Emotion recognition through artificial intelligence and smart sensing of physical and physiological signals (Affective Computing) is achieving very interesting results in terms of accuracy, inference times, and user-independent models. In this sense, there are applications related to the safety and well-being of people (sexual aggressions, gender-based violence, children and elderly abuse, mental health, etc.) that require even more improvements. Emotion detection should be done with fast, discrete, and non-luxurious systems working in real-time and real life (wearable devices, wireless communications, battery-powered). Furthermore, emotional reactions to violence are not equal in all people. Then, large general models cannot be applied to a multiuser system for people protection, and customized and simple AI models would be welcomed by health and social workers and law enforcement agents. These customized models will be applicable to clusters of subjects sharing similarities in their emotional reactions to external stimuli. This customization requires several steps: creating clusters of subjects with similar behaviors, creating AI models for every cluster, continually updating these models with new data, and enrolling new subjects in clusters when required. A methodology for clustering data compiled (physical and physiological data, together with emotional labels) is presented in this work, as well as the method for including new subjects once the AI model is generated. Experimental results demonstrate an improvement of 4% in accuracy and 3% in f1-score w.r.t. the general model, along with a 14% reduction in variability.
- Europe > Spain > Galicia > Madrid (0.05)
- Europe > Switzerland > Vaud > Lausanne (0.04)
- Europe > Montenegro (0.04)
- Information Technology > Artificial Intelligence > Cognitive Science > Emotion (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)